Modeling Reduced Pronunciations in German

نویسندگان

Martine Adda-Decker

Lori Lamel

چکیده

This paper deals with pronunciation modeling for automatic speech recognition in German with a special focus on reduced pronunciations. Starting with our 65k full form pronunciation dictionary we have experimented with different phone sets for pronunciation modeling. For each phone set, different lexica have been derived using mapping rules for unstressed syllables, where schwa-vowel+[l n m] are replaced by syllabic [l n m]. The different pronunciation dictionaries are used both for acoustic model training and during recognition. Speech corpora consisted of television programmes, which contain signal segments of a varying acoustic and linguistic nature. The speech is produced by a wide variety of speakers, with linguistic styles ranging from prepared to spontaneous speech and with changing background and channel conditions. Experiments were carried out using 4 news programmes and documentaries lasting more than 15 minutes each (with a total of 1h20min). Word error rates obtained vary between 19 and 29%, depending on the programme and the system configuration. Only small differences in recognition rates were measured for the different experimental setups, with slightly better results obtained by the reduced lexica. 146 Adda-Decker & Lamel

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating text normalization and pronunciation variants for German broadcast transcription

In this paper we describe our ongoing work concerning lexical modeling in the LIMSI broadcast transcription system for German. Lexical decomposition is investigated with a twofold goal: lexical coverage optimization and improved letter-to-sound conversion. A set of about 450 decompounding rules, developed using statistics from a 300M word corpus, reduces the OOV rate from 4.5% to 4.0% on a 30k ...

متن کامل

How are words reduced in spontaneous speech?

Words are reduced in spontaneous speech. If reductions are constrained by functional (i.e., perception and production) constraints, they should not be arbitrary. This hypothesis was tested by examing the pronunciations of highto mid-frequency words in a Dutch and a German spontaneous speech corpus. In logistic-regression models the "reduction likelihood" of a phoneme was predicted by fixed-effe...

متن کامل

Using acoustic models to choose pronunciation variations for synthetic voices

Within-speaker pronunciation variation is a well-known phenomenon; however, attempting to capture and predict a speaker's choice of pronunciations has been mostly overlooked in the field of speech synthesis. We propose a method to utilize acoustic modeling techniques from speech recognition in order to detect a speaker's choice between full and reduced pronunciations.

متن کامل

Automatic detection of anglicisms for the pronunciation dictionary generation: a case study on our German IT corpus

With the globalization more and more words from other languages come into a language without assimilation to the phonetic system of the new language. To economically build up lexical resources with automatic or semi-automatic methods, it is important to detect and treat them separately. Due to the strong increase of Anglicisms, especially from the IT domain, we developed features for their auto...

متن کامل

A Neurophysiological Investigation of Non-native Phoneme Perception by Dutch and German Listeners

The Mismatch Negativity (MMN) response has often been used to measure memory traces for phonological representations and to show effects of long-term native language (L1) experience on neural organization. We know little about whether phonological representations of non-native (L2) phonemes are modulated by experience with distinct non-native accents. We used MMN to examine effects of experienc...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Modeling Reduced Pronunciations in German

نویسندگان

چکیده

منابع مشابه

Investigating text normalization and pronunciation variants for German broadcast transcription

How are words reduced in spontaneous speech?

Using acoustic models to choose pronunciation variations for synthetic voices

Automatic detection of anglicisms for the pronunciation dictionary generation: a case study on our German IT corpus

A Neurophysiological Investigation of Non-native Phoneme Perception by Dutch and German Listeners

عنوان ژورنال:

اشتراک گذاری